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Abstract 

We study variants of the well-known Collatz graph, by considering the action of the 3n + 1 function on congruence 
classes. For moduli equal to powers of 2, these graphs are shown to be isomorphic to binary De Bruijn graphs. Unlike the 
Collatz graph, these graphs are very structured, and have several interesting properties. We then look at a natural general- 
ization of these finite graphs to the 2-adic integers, and show that the isomorphism between these infinite graphs is exactly 
the conjugacy map previously studied by Bernstein and Lagarias. Finally, we show that for generalizations of the 3n+ 1 
function, we get similar relations with 2-adic and p-adic De Bruijn graphs. 

Keywords: 3n+ 1 problem, Collatz conjecture, De Bruijn graph, shift map, conjugacy, 2-adic integers. 

1 Introduction 

The 3n + 1 or Collatz conjecture is a long-standing open problem in mathematics. Let the 3n + 1 function T be defined on 
the integers by 

T , s /(3n + l)/2 if n is odd, 

T(n) = i . (1) 

I n/2 if n is even. 

The Collatz conjecture states that, starting from any positive integer n, repeated application of the function T will eventually 
produce the number 1, after which it will end in the cycle {1,2}. This conjecture is true if and only if, on the positive integers, 
there are no divergent paths (i.e., lim^c T k (n) < °° for all positive integers n, where T°(n) = n and T k+l (n) = T(T k (n)) for 
k > 0) and there are no other cycles besides the trivial cycle {1,2} (i.e., there are no natural numbers n > 3 with T k (n) = n 
for some k > 1). Though easy to state, this problem seems very hard, if not impossible to solve. 

Because of its simple formulation, researchers from many different branches of mathematics have at one time or another 
encountered this problem and have become fascinated by it. This has lead to hundreds of papers in the last few decades, with 
each researcher using his own area of expertise to shed a new light on this problem. An excellent overview of many of these 
papers was given by Lagarias 1 16 17], while extensive surveys of previous work on this problem can be found in books by 
Lagarias [ 15 1 and Wirsching 1 27 1. 

Three of those branches of mathematics that have been used to study the Collatz conjecture are those of graph theory, 
modular arithmetic and 2-adic integers. This paper aims to show connections between these three approaches. 

We start in Section [2] with studying modular Collatz graphs, i.e., finite graphs that capture the behaviour of the 3n + 1 
function on congruence classes of integers. It turns out that there is an intimate relation to binary De Bruijn graphs when the 
modulus is a power of 2. Letting this modulus grow to infinity, in Section [3] we are led to studying these problems on the 
2-adic integers. This leads to a natural generalization of the binary De Bruijn graphs to the 2-adic integers. In Section|4]we 
look at the structure of this infinite graph, and we try to describe how various Collatz graphs are embedded in it. In Section|5] 
we briefly indicate possible generalizations. 



2 Binary Collatz graphs and binary De Bruijn graphs 

One particular approach to the 3n + 1 problem that caught our attention is using directed graphs to visualize the action, and 
in particular iteration, of the function T. We denote a directed graph by G = (V,E), where V is the set of vertices, and E 
is the set of directed edges. Since we will not be dealing with undirected graphs, we will refer to directed graphs simply as 
graphs. Consider the graph C(N + ) = (V,E) with vertices V = N + and edges £ = {/?—> T(n) : n e V}. This graph is known 
in the literature as the Collatz graph | 2 13 . 24 , 27 1, and allows for a simple visual explanation of the 3n+ 1 problem to a 
broad audience 1261 . 

Since the Collatz graph has infinitely many vertices and looks very chaotic, we introduce a new family of related, finite 
graphs. Given some modulus m, we define the modular Collatz graph with modulus m as the graph G = (V,E) with vertices 
V = {0, 1, . . . , m — 1}, and a directed edge runs from a to b if there exist some numbers a\ , b\ £ Z, with a\ = a (mod m) and 
b\=b (mod m), such that T(a\ ) =b\. For instance, taking m = 3 leads to the graph on three vertices in Figure[T] 

As can be seen from this graph, once we 'leave' the set 37* we will never return, as there are no edges going into the 
vertex 0. The only way not to leave this set is to have T k (n) = (mod 6) for all n, which implies n = 0. Hence proving that 
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Figure 2: The binary Collatz graphs C(2) with modulus 4 (left) and C(3) with modulus 8 (right). 



all positive integers n = ±1 (mod 3) iterate to 1 is sufficient to prove the Collatz conjecture. Although this conclusion may 
seem trivial, and similar but stronger results have been derived by the Monkses 1 23 , 24 1 , it shows that studying these graphs 
may be useful. 

Upon further inspection, it turns out that in general, these graphs do not look particularly nice or structured. But when 
we take the modulus m to be some power of 2, these graphs do have a nice structure. From now on we will therefore focus on 
what we call binary modular Collatz graphs, or simply binary Collatz graphs. We write C(k) for the modular Collatz graph 
with modulus m = 2 k , and we refer to this graph as the binary Collatz graph of dimension k. For convenience, we write 
for forward iteration in the graph C(k), i.e., T^n) is the set of vertices v in the graph C(k) that are connected to n by an edge 
n —¥ v. This relation can be explicitly written as[j 



T (n)= .{(3« + l)/2,(3«+l)/2 + 2 t - 1 } if n is odd, 
" S {n/2,n/2 + 2*- 1 } ifrciseven. 



For k = 2,3 we get the graphs C(2) and C(3) shown in Figure [2] These can also be found in a recent paper of Monks et 
al. l24l Figures 7.1,7.2]. 

Looking at these graphs, we can immediately see a lot of structure. Both graphs have several symmetries, every vertex 
has two incoming and two outgoing edges, and reversing the direction of each edge leads to a graph isomorphic to the original 
graph. In the graph C(2) we have also labeled each edge with the corresponding congruence class modulo 8, e.g., the edge 
from 2 to 3 has a label 6, because the numbers a = 2 (mod 4) satisfying T(a) = 3 (mod 4) are exactly all numbers a = 6 
(mod 8). With this labeling, we can see a connection between C(2) and C(3): the latter can be formed by taking the so-called 
line graph of the former, associating edges in C(2) to vertices in C(3) and connected edges in C(2) to edges in C(3). 

Seeing these beautiful graphs, one may wonder what is known about these graphs, and the title of this paper gives most 
of it away. Given an alphabet £ = {0, 1, ... ,p — 1} of size p and a wordlength k, the p-ary De Bruijn graph of dimension 
k is defined as the graph B(p, k) = (V, E) with vertex set V = T. k , and an edge runs from the word «o a l ' ' ■ a k- 1 to tne w °rd 
bob\ ■ • ' frjfc-l if an d only if a i+\ = bj for i = 0, . . .,k— 2. Thus, an edge runs from one word to another if the last A: — 1 symbols 
of the first word overlap with the first k — 1 symbols of the second word. When p = 2 we also refer to these graphs as binary 
De Bruijn graphs. Besides viewing the vertices as words of a fixed length over some finite alphabet, it is also convenient to 
associate numbers between and p k — 1 to the vertices. For this we identify words bob\ ■ ■ ■b%_\ with numbers bip' '• 
Figure [3] shows the two different labelings of the binary De Bruijn graph 6(2,3) of dimension 3. For the remainder of this 
paper, we will choose to label the vertices with these numbers rather than with words over a finite alphabet. 

We write a p 4 for forward iteration in the p-ary De Bruijn graph of dimension k. In terms of finite words over Z, this 
corresponds to O^C^O^l ' "bk—l) = {^1^2 ' ' m bk—l x '■ x £ £}• F° r now we w i° f° cus on the case V = % when the relation 
0~2,i: can be described in terms of numbers as 

j{(n-l)/2,(n-l)/2 + 2 k - 1 } if«isodd, 
N;j/2,n/2 + 2 } if n is even. 

Since this relation shifts the bits to the left and appends a new bit, we will refer to this relation as the (binary) shift relation. 

De Bruijn graphs are closely related to, and mostly studied for, finding De Bruijn sequences: cyclic sequences of symbols 
over a finite alphabet containing each sequence of length k exactly once as a subsequence. For instance, a De Bruijn sequence 
for k = 3 is given by 00010111(00). These sequences correspond precisely to Hamiltonian paths in De Bruijn graphs. The 

above sequence corresponds to the path starting at 000, and following the path 000 -^>001 AoioA- 101 A-011 \ 111 M 

110 ^5 100, visiting each vertex of the graph exactly once, hence containing each sequence of length 3 as a subsequence 
exactly once. Since De Bruijn graphs are Hamiltonian, such De Bruijn sequences exist for any value of p and k, and can be 
constructed from a Hamiltonian path in a De Bruijn graphQ 



'in this expression, numbers should be calculated modulo 2*, as all vertices correspond to congruence classes modulo 2*. 

2 Due to the exponential size of the De Bruijn graphs, this method of finding De Bruijn sequences is not very efficient. In Sectionf4]we will see a more 
practical method for generating these sequences. 
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Figure 3: The binary De Bruijn graph of dimension 3, with binary (left) and numeric (right) labels on vertices. 




Figure 4: The graphs C(4) (left) and B(2,4) (right), and the corresponding isomorphism <J>4 = ( 1 , 5) (2, 10) (9, 13). 

Comparing Figures [2] and [3] it is clear that the graph 6(2,3) has exactly the same structure as the binary Collatz graph 
C(3). In fact, for any value of k, the graphs C(k) and B(2,k) have exactly the same structure. This can be seen as follows. 
Let us use the notation (x,(«)) for the 0—1 sequence defined by Xj(n) = T'(n) (mod 2), for i e N, as in 1131 . For example, 
since the orbit of = 3 under iterating T is (3,5,8,4,2, 1,2, 1, 2, .. .), the sequence (*;(«)) is this orbit taken modulo 2, i.e., 
(1,1,0,0,0,1,0,1,0,...). Then it is immediate that Xi(T(n)) = T'{T{n)) = T' +l (n) = x i+l (n) (mod 2), i.e., applying T 
to n means shifting the sequence (x,-(n)) one position to the left. The first k terms of (x,-(n)) depend only on the value of 
n (mod 2 k ), and x^ + \{n) and x k+ \(n + 2 k ) always are different, as shown in 1131 Equation (2.9)]. So if we consider the 
(iterated) action of T on a congruence class of some n (mod 2 k ), then we find that the first k elements of (x,(«)) are identical 
for all numbers in this congruence class, and that the first k elements of (x,(r(«))) are those of (xi(n)) shifted by one to the 
left, with at the end added a or 1, both occuring equally often. But this shows exactly that one forward iteration in the 
binary Collatz graph of dimension k can be described as the shift relation on the bit sequences of length k, i.e., as the forward 
iteration on the binary De Bruijn graph of dimension k. 

To summarize our result, we introduce the function : {0, . . . , 2 k — 1} — ► {0, . . . , 2* — 1}, defined by 

fc-l 

**(n)= 2>(n)2'. (4) 

(=0 

Then we have the following statement. 

Theorem 2.1. For any k > 1, the function 4>j is an isomorphism between C(k) and B(2,k). 

For the binary Collatz graphs, we introduced the notation T k to indicate forward iteration in the graph, and for binary De 
Bruijn graphs we wrote <J2,k f° r walking along edges in these graphs. Since 5>£ is an isomorphism between the two graphs, 
we can describe the relation between 2j and 0(2$ via by 

T k = % x oo 2 .k°<i> k , (5) 

where ^ ({a, ft}) = {O^ 1 (a),*^ 1 (ft)}. This is why ® k may be called a (k-dimensional) conjugacy map. 

While the relation between De Bruijn graphs and the Collatz conjecture does not appear in the literature, these bijections 
<i> k have been explicitly studied before by Bernstein and Lagarias 1 3 4 , 13], but in a context different from graph isomor- 
phisms. In 1131 Table 2] these bijections were explicitly given as permutations on numbers between and 2 k — 1, e.g., for 
t = 3we get the permutation <I>3 = (1,5) cf. Figures[2][3] and for k = 4 we get the permutation $4 = (1,5)(2, 10)(9, 13) cf. 
Figure|4] 

Since the graphs C(k) and B(2,k) are isomorphic, they share several properties. As mentioned before, the line graph of a 
A:-dimensional binary Collatz graph is isomorphic to the (k+ 1) -dimensional binary Collatz graph, and the transpose graph 
of C(k), obtained by reversing the direction of each edge, is again isomorphic to C(k). One other remarkable property is that 
after exactly k steps in these graphs, there is exactly one way to end up at any vertex in the graph (including the vertex itself). 
This can be expressed in terms of the adjacency matrix of the graph, A k , by (-A|)y = 1 for all indices i, j. This was previously 
noted by Feix et al. |7|, and Feix and Rouet |8|. Since every vertex has two outgoing edges, it follows that (Af);j = 2 * 
for I > k. In particular, this value does not depend on ( or j. Thus, if we start at any vertex in the graph, and take k or more 
random steps in the graph, we can be anywhere with equal probability. So if we only know the k least significant bits of a 
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number », then we know absolutely nothing about T k {n). This also gives some intuition why the 3n + 1 problem is so hard: 
unless we know exactly what number we started out with, during each iteration we lose one bit of information about the 
resulting number. 

We are ultimately looking for more insight into the 3n + 1 problem, but these graphs are not quite the same as the Collatz 
graph. The vertices are congruence classes rather than numbers, and the graphs are finite. Still, there are relations between 
these binary Collatz graphs and the regular Collatz graph. For instance, if we simply restrict the Collatz graph to the vertex 
set {0, ... ,2 k — 1}, we get a subgraph of C(k). Also, each edge in the Collatz graph corresponds to a sequence of edges in 
the graphs C(k), e.g., the edge 5 — > 8 in the Collatz graph corresponds to the edges 1 — ► in C(l) and C(2), the edge 5^0 
in C(3), and the edge 5 — > 8 in the graphs C(k) for k > 4. By considering the sequence of graphs C(k), and taking only those 
edges that appear in infinitely many of these graphs, we exactly get the Collatz graph on the natural numbers (including 0). 



3 The 2-adic Collatz graph and the 2-adic De Bruijn graph 

While considering finite Collatz graphs on congruence classes leads to nicely structured graphs, we would like to know more 
about the infinite Collatz graph on the natural numbers. Therefore it would be interesting to consider extensions of these 
binary Collatz graphs and binary De Bruijn graphs to larger (infinite) vertex sets, by in some way letting k go to infinity. 
Then vertices may actually correspond to numbers rather than residue classes. 

Although it is not exactly clear what "limj._ s . 00 C(fe)" means, we can naturally find an answer to this question by taking 
a detour along the De Bruijn graphs. First, we can easily extend the concept of De Bruijn graphs to infinite sequences of 
symbols over a finite alphabet. We define the infinite binary De Bruijn graph by the graph G = (V.E), where the vertex set is 
defined by V — {bob\ ■ ■ ■ : bj £ {0, 1}}, and an edge runs from a vertex aoa\ ■■■ to a vertex b§b\ • ■ ■ if and only if i = /?,-, 
for each i £ N. Similar to the previous section, to these infinite sequences we may associate numbers, by considering the 
set of 2-adic integers Z 2 1111 . We will identify sequences £>qZ?i ' ■ ' with 2-adic integers With this labeling of the 

vertices, we will also refer to the infinite binary De Bruijn graph as the 2-adic De Bruijn graph, or #(^2). Note that in this 
graph, each vertex has only one outgoing edge. So while the shift relation ojj u for finite binary De Bruijn graphs, mapping 
vertices m to its set of neighbors, is not a proper function, forward iteration in the 2-adic De Bruijn graph can be seen as a 
proper function from Z2 to Z2. We denote this function by 02, and it satisfies 

, in— 11/2 if n is odd, 
n/2 11 n is even. 

In terms of binary sequences, this function 02 simply removes the first bit of a number and shifts the 'remaining' bits one 
position to the left. This function is therefore known in the literature as the shift map fj|3l 4 ] 1211221 . 

Before going back to the 3n + 1 problem, consider the limit of the isomorphisms <5j., for k — ► °°. Using the definition 
(") = Li=o x i( n )2', we can just let k tend to infinity to obtain <I>(n) = Y.h=o x i( n )^' ■ I n <^2> this is a convergent series, and 
for any neZ2,$(n) is a well-defined 2-adic integer. By investigating <S>~ 1 (BiTh^)) it then becomes clear what "lim£_ Ko C(fc)'' 
should be, as <I> (fi(Z2)) corresponds exactly to the Collatz graph extended to the 2-adic integers. The extension of the 
3/7 + 1 function T to the 2-adic integers was previously studied in 1 1 , 3,4, 12, 14,21 , 22,27;]. We will refer to this graph, with 
vertex set Z2 and edge set {n — > T(n) : n e Z2}, as the 2-adic Collatz graph, and denote it by C(Z2). The following theorem 
is immediate. 

Theorem 3.1. The graphs Ci^i) an d B(Z 2 ) are isomorphic, and the function <t> : Z2 — > Z2, defined by 

*(n) = £>(«)2 ; , (7) 

!=0 

is an isomorphism from C(7i2) to B(%2), 

The function <I> is known in the literature as the conjugacy map [3 ,4, 12 , 13 15 ,22 , 27], and it clearly satisfies 

rE^'odo*. (8) 

For values of n for which the behaviour of iterates of n is completely known, one can easily compute <5(n). For instance, 
4>(1) = 101010... = — 1/3, <I>(5) = — 13/3, and all numbers ending in the cycle {1,2} correspond to rational numbers in the 
2-adic De Bruijn graph with denominator 3. The Collatz conjecture is equivalent to the statement 4>(N+) C ^Z\Z 1131 . 



4 Structure inside the 2-adic Collatz graph 

Let us now further investigate the graph C^L-i). First, the graph C(Z 2 ) is not connected, and contains uncountably many 
components. It contains two types of components: cyclic components, corresponding to cycles of the function T; and 
divergent components, which do not contain a cycle, but extend infinitely far in both directions. 

The subgraph of C(Zy containing all divergent components, which we will denote by C rf "'(Zy, is arguably the least 
interesting part of the graph. Although it contains uncountably many components, they are all pairwise isomorphic, and the 
vertices in each of these graphs are all indistinguishable, in the sense that the forward mapping is an automorphism. Figure|5] 
shows one of these components, containing an irrational 2-adic integer a = 10110- • • , and the corresponding component in 
B(Z 2 ). 

Note that theoretically, it is possible that a divergent component arises from some rational 2-adic integer, i.e., a rational 
number with an odd denominator. It has been conjectured 1131 Periodicity Conjecture] that all rational 2-adic integers are 
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Figure 5: A divergent component in C^L%) with an irrational number a = 10110. . . (left), and the corresponding component in B(1q) with the irrational 
number 3>(a) = jS = 10010. . . (right). Note that the first five bits of /3 follow from 4> 5 (10110) = 10010, or equivalently, 4>s(13) = 9. 



part of a cyclic component, which can be formulated as <&(Z2 fl Q) = Z2 H Q. But so far, we only know for certain that cyclic 
components must arise from rational numbers, i.e., <5(Z2 l~lQ) 3 Z2 HQ (3) Corollary 1]. 

The other part of the graph, formed by the 2-adic integers that end in a cycle, has different properties. The number of 
components is only countably infinite, since we can enumerate all possible cycles to find all components. For instance, there 
are two components with a cycle of length 1 (containing the cycles A and — 1 — ¥ — 1), and there is exactly one component 
with a cycle of length 2, shown in Figure [6] The Collatz conjecture states that this latter component contains the entire 
Collatz graph on the positive integers as a subgraph. 

Writing for the number of components in C(Zy with a cycle of length k, by counting words of length k and computing 
their cyclic orders, it follows that 

2 k =Y,dM d . (9) 

d\k 

Applying Mobius inversion to the above, we obtain a direct formula for Mj. as 

M k = -Tfi{d)2 k ^ = -2" + 0(2"/ 2 ). (10) 

The sequence (Mk)k>\ = (1, 2, 1, 2, 3, 6, ) has been encountered before in the context of the Collatz conjecture 1 14 1, but has 

also been studied in different contexts [28 1 and is known as Moreau's necklace-counting function 1251 . What this function 
M is actually counting is the number of Lyndon words of length k: strings of length k that are inequivalent modulo cyclic 
rotations, and with period equal to k [18]. Somewhat surprisingly, a connection between Lyndon words and De Bruijn 
sequences has been made before in the literature (9) 1 101 : one may obtain a De Bruijn sequence of order k by appropriately 
concatenating Lyndon words of length some divisor d of k. For instance, for k = 4, the Lyndon words of lengths d\k are 
given by 0, 1 , 01 , 0001 , 001 1,0111. By extending these words to words of length 4, we may order them lexicographically as 
0(000), 0001, 0011, 01(01), 0111, 1(111). Concatenating the Lyndon words in this order, we get the sequence 

0|0001|0011|01|0111|1, (11) 

which is indeed a De Bruijn sequence of order 4, corresponding to a Hamiltonian path in the graph S(2,4). This algorithm 
is known in the literature as the FKM algorithm, after its authors Fredricksen, Kessler and Maiorana |[9l llOI . 

Another fascinating property of the cyclic part of the 2-adic Collatz graph, is the fact that there is a one-to-one correspon- 
dence between cycles in C(Z2), and integer cycles of the family of 3n + b functions T^' b > defined by 

fth\. , I (3n + b)/2 if n is odd, 
T m (n) = \\ " . (12) 

In/2 if 11 is even, 

for odd b coprime to 3. More precisely, if there is a rational cycle in C(Zy with all numbers having denominator b, then this 
cycle corresponds exactly to an integer cycle of the 3n + b problem. This follows from the fact that T(n/b) = T^' b \n)/b. 
This correspondence was previously noted by Lagarias 1 14 1. So together, the 3n + b problems on integers are very structured, 
as we know that every cycle (Lyndon word) corresponds to exactly one value b, and finding the b associated to a given 
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Figure 6: The components corresponding to the cycle (01), in C{l*i) (left) and in B(Zy (right). The Collatz conjecture states that the component on the left 
contains all positive integers. 
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Figure 7: The structure inside CC&j) (left) and the corresponding structure inside B{7Li) (right). Solid arrows pointing to the same larger graph imply that 
the large graph can be partitioned into these smaller graphs. Dotted lines correspond to graph inclusions. 



Lyndon word is also not so hard. But when we try to find all cycles associated to a value b, we get stuck. Note that solving 
this problem would solve the Collatz conjecture, and much more. 

Going back to the complete 2-adic Collatz graph C(Z 2 ), we saw that we may divide the graph in two parts: a cyclic part, 
and a divergent part. Furthermore, we know that all cycles must correspond to rational numbers, but we do not know whether 
all rational numbers also correspond to cycles. So theoretically, we have three different types of components: cyclic, rational 
components; divergent, rational components; and divergent, irrational components. We will denote these parts of the graph 
with C CVC (Z 2 HQ), C dh '(Z 2 HQ), and C(Z 2 \Q) respectively. Each of these parts of the graph corresponds to a part of 5(Z 2 ), 
under the map <J>. 

The complete structure of C(Z 2 ) and its image under <!> in 6(Z 2 ) are summarized in Figure|7] The periodicity conjecture 
1 13] states that the middle part, C dlv (7L 2 HQ), is empty. If this is true, then the picture somewhat simplifies, as then we would 
get *C(Z 2 n Q) = B(Z 2 n Q) and *C(Z 2 \ Q) = B(Z 2 \ Q). 

The challenge of the Collatz conjecture now can be seen as a study in more detail of the embedding of C(N+) into 
C(Z 2 ). The idea is that C(N+) is a rather chaotic object, which lies somehow inside the beautifully structured object C(Z 2 ). 
The latter graph is isomorphic via the conjugacy map <I> to S(Z 2 ), which indeed has a rich and well understood structure. 
Similarly other interesting subgraphs of C(Z 2 ) can via <1> be embedded into S(Z 2 ). 



5 Generalizations and the p-adic De Bruijn graphs 
5.1 an + b problems and the 2-adic De Bruijn graph 

In the previous section we briefly mentioned a generalization of the 3n + 1 function T to 3n + b functions r( 3 '*) . A frequently 
considered further generalization of these functions is the family of an + b functions 1 1 3 4 12 14 21 22 27|. For odd a,£> 6 Z, 
we define the an + b function T^ a ' b "> by 

c„ h\ / \ I {an + b) 12 if n is odd, 
T {a ' b >(n) = { v " (13) 
In/2 if n is even. 

For a = 3, we expect that no divergent paths exist, since with equal 'probability', we either multiply a number by 1/2 or 
(roughly) by 3/2. So on average, the numbers increase by a factor \/3/4 < 1. For a > 5, this heuristic suggests a different 
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behaviour. For instance, for the 5n + 1 problem and the ln+\ problem, we expect most paths to diverge 161 1151 . 

Although in this sense, the an + b problems behave differently from the 3n + 1 problem, the story described above related 
to De Bruijn graphs applies to all of these problems as well. Defining C( a >*)(*) similarly as the Collatz graphs C(-) but for 
the an + b function T^ a,b \ and defining x\ (n) analogously to *, («), we obtain the following result. 

Theorem 5.1. For any k > 1, the graphs C^ a ' b \k) and B(2,k) are isomorphic, and the function 3>1 : {0, ... ,2 k — 1} — > 
{0,...,2*-l}, defined by 

<t>^ b \n)= k £j?' b \n)2\ (14) 
i=0 

is an isomorphism between C( a '*)(£) and B(2.k). Furthermore, the graphs C^ a ^>{%2) and B{^Zq) are isomorphic, and the 
an + b conjugacy map <i>( a - b ) ; TL^ — > 1i< defined by 



^"■ b \n) = ^ b \n)2\ (15) 



i=0 

is an isomorphism from C^ a,b >{%2) t° B(^2)> satisfying 

T (a,b) _ ^(.a,b)yl 0(j20 ^(a,b)_ (16) 

For example, for the 5n+ 1 problem and k = 3, we get the isomorphism 9* = (1, 3) (2, 6) (5, 7). 

It is of interest to observe that for any an + b problem, the binary modular graphs that appear are always isomorphic 
to the same binary De Bruijn graphs. Only the labeling of the graphs depends on the choice of a and b. In other words, 
the binary and 2-adic De Bruijn graphs themselves do not contain much information anymore about the generalized Collatz 
problems. The wanted information is contained in the isomorphisms (the labelings of the graph edges, the conjugacy maps), 
which are different for each a, b, and not well understood at all. That these maps really are different can be seen by noticing 
that the conjugacy map for the 3n+ 1 function maps C(N+) into (most probably) one cyclic connected component in B(Zy, 
while for the 5n+ 1 function the image of C' 5 I '(N+) under its conjugacy map ends up in at least 3 cyclic components, but 
(most probably) in infinitely many components, most of which are divergent. While C dn '(Z2 HQ) is conjectured to be empty, 
(C( 5,1 )) rf!V (Z2 fl Q) is conjectured to have infinitely many distinct components. This may be seen as an illustration of the 
profound difficulty of the Collatz conjecture and its siblings. 

Note that for a = 1 and b = —\, the isomorphism cj)' 1 - -1 ) is extremely well understood, since TV' -1 ) = o~2 an d <&( 1,_1 ) = 
id, the identity map. So one could also write 5(2, k) =C( 1 " 1 )(i) andB(Z 2 ) = C^ 1 " 1 ) (Z 2 ). 

5.2 p-ary functions and the p-adic De Bruijn graphs 



A further generalization of the function T was considered in, e.g., |7 8, 19-21 27 1. For some appropriately chosen integers 
aj and £>,-, let / be defined by 

'(aon + bo)/p if n = (mod/?), 

Aa x n + b x )lp \in=\ (mod/?), 

/(«) = \ (17) 

Xa p -\n + bp-\) i 'p if n = p— 1 (mod p). 
For instance, a problem Lagarias attributes to Collatz [13] concerns the function /o defined by 

{2n/3 ifn = (mod 3), 

(4n-l)/3 ifn=l (mod 3), (18) 
(4«+l)/3 ifn = 2 (mod 3). 

Iterating these functions leads to similar behaviour as the an + b problems. In this case, when considering graphs on congru- 
ence classes modulo p k , we find a relation with />-ary and />-adic De Bruijn graphs as follows. 

Theorem 5.2. For any k > 1, the graphs C^' (p,k) and B(p,k) are isomorphic, and the function '■ {0, . . . ,p k — 1} — > 
{0, . . . , pr — 1}, defined by 

H / jw=E4 /) («)/' ! '. 09) 

i=0 



is an isomon 



phism between C^\p,k) and B(p,k). Furthermore, the graphs C^\ld P ) and B^Lp) are isomorphic, and the 



function <S>p : li p — > lj p , defined by 

<t> p f \n) = Y^x { /\n)p\ (20) 
i=0 

is an isomorphism from C^(Ij p ) to B(Z p ), satisfying 

f={(^ ] )- X oa p o^p. (21) 

As an example, Figure [8] shows the 2-dimensional ternary graph corresponding to the function /o in Equation J 1 8b . and 
the corresponding ternary De Bruijn graph. 
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Figure 8: The ternary modular graph of dimension 2 corresponding to the function fo in 118) and the ternary De Bruijn graph of dimension 2, B(3,2). The 
corresponding isomorphism j of order 7 is given by the permutation = (1,4, 5, 7, 3, 2, 6). 
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